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Abstract IU 

rae eee Te D 
This report investigates the problem of matching two 

different sketches of the same scene. The process of 

matching enables pairing of corresponding parts of the 

two sketches thus allowing registration of one sketch with 

respect to the other. Once registered, detection of 

matching subpatterns and/or discrepancies is possible. 


Sketches are represented as structured arrays of points 
(Cartesian coordinates and other associated information 
which embody the most important features of the sketch). 


Two sketches are matched by comparing all point pairs 
of one sketch with those of the other. The decision as to 
whether or not one point pair (fuzzily) matches another is 
based on how well the length and slope of the imaginary line 
segment between the first pair coincides with the length and 
slope of the second pair. «~— 


The power of the matching process is tested by adding 
varying amounts of noise--in the form of Brownian motion, 
rotation, rescaling, random point additions/deletions~-to 
determine how fast successful performance degrades. 


Several means of reducing/combatting the effects of 
noise are considered. The two most important ideas are 
adding additional information about each point and attempt- 
ing to determine the rotation/scaling of one sketch with 
respect to the other. es for further research 


are also included. Y 
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The goal of the research underlying this report is 
to develop a basis for additional research in the area of 
map-guided segmentation and interpretation. That is, given 
an appropriately represented map of some arbitrary area, one 
can use that information to aid in the seymentation and 
interpretation of, for example, an aerial photograph of the 
same area. 

Aftec corresponding parts of the map and photograph 
have been determined, the two images may be 
"“differenced";that is, matching Subpatterns may be factored 
out, leaving only the dirferences (objects, changes, etc.). 
These discrepancies could then be “interpreted" by some 
intelligent process. Appropriate interpretation could 
reveal such things as tumors, dental catcies, or military 
targets. 

Specifically, this report investigates the probles 
of point pattern matching. The map, photograph, etc. is 


represented as a list of points ( Cartesian coordinates and 


other associated information ) which embody the sost 


important features of the image. This information is 
collectively referred to as a sketch. 

Chapter 1 introduces the topic of sketch satching 
and provides appropriate background infocmation. Also 
presented is a review of related literature. 

Chapter 2 describes the basic, brute force satching 
algorithar and its aotivation. A simple example illustrates 
the various steps involved in the process of satchinog. 

Chapter 3 relates the results of the saatching 
algoritha in several test cases. First, sketches of tuo 
different ( types of ) pictures, a tank anda Washington 
D.C. area road map, ace matched. Second, the road sap 
sketch is matched to copies of itself to which varying types 
and agounts of noise have been added. 

Chapter 4 describes the results of two methods used 
to reduce the effects of noise. 


Chapter 5 summarizes the research and presents some 


Suggestions for further work. 
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CHAPTER 1 
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INTRODUCTION 


Tha goal of the research described herein is to 
build a fcundation for a map-guided segmentation and 
interpretation system to be developed by the University of 
Maryland*s Computer Vision Lab ( CVL). 

The basic, underlying fhilosophy of the proposed 
system is the following. If one has a map of appropriate 
resolution and information content and scme method{s) of 
seqmenting that mag into its constituent parts, then one can 
use those results to aid in the segmentation and 
interpretation of an aerial photocraph, for example, of the 
same area. as described by the mar. 

While the map and photografh are certain to contain 
many differences, there should te a significant nuaber of 
features ( call them subpatterns ) common to both. For 
example, rivers, lakes, major road systems, etc. ( call then 
subpatterns ) shculd arfrear on both images. These 
corresponding subpatterns could be put into registration by 
a matching process. 

Once registered, the matching subpatterns could be 
factored out of one or bcth images, as appropriate, leaving 
only the discrepancies, differences, changes, and so on. 
These would finally be examined by some “intelligent 


process having access to a data tase containing information 


necessary to interpret such remnants, The development of 
such a matching system from a Simple, basic algorithsm to a 


more powerful one is the topic of this thesis. 
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1.1. Domain of research 


Much work has been done on image region matching for 
such purfoses as stezeomapping, change detection, etc. 
While this approach is very important in many applications, 
it cannot tolerate much gecmetrical distortion. For that 
reason the author's research has dealt with the problem of 
sketch matching. 

A sketch, in general, is a line drawing which 
represents significant region boundaries, lines, curves, and 
points in a scene. Fcr simplicity, however, a sketch will 
be cefined as that set of points ( point pattern ) which 
represents the most important features of a sketch such as 
line/curve intersection(s), curvature maxima, etc. A pore 
formal definition can te found in Section 2.1. 

The remainder cf this charter describes the relevant 
point-pattern matching literature. All algorithas, 
including the author's, deal with patterns in 


two-dimensicnal sface. 


Simon, Checroun, and Roche [S1] discuss a_ process 


for recognizing patterns of N- points. The recognition 


process 
obtained 


N (N-1) 


is based on the comparison of the "ordounances" 

by ordering tne Euclidean distances between the 

7 2 point pairs of the given pattern. 

Their algorithm is as follows: 

(1) Compute the N (N-1) / 2. point-pair distances, 
d(i), for each pattern. 

(2) Index sort each list of d(i)'s into descending 
order. 

(3) Using the indices generated in step (2), 
respectively, order the point-pair label list 
for each pattern. 

(4) Compute from the lists or step (3) new lists in 
"“rarenthesized" notation. Assuming each 
distance was computed with some error, epsilon, 
then perhaps some members of the distance lists 
(and hence label lists) may have been permuted. 
Given this assumption, the parenthesized list 
indicates which label list uembers' order may be 
pecmuted in step (5). 

(5) Select some lavel, (point) A (point) B, for 


example, from one list; say it occurs as_ the 


first member of the list. Next, locate ( the 


positions of ) the N-2 other occurrences of 
point A (or B). Select, from the other list, 
those labels at the same possitions as point A. 
If some point, A*, for example, occurs in all 
selected labels, then A may be paired with A*. 
Determine any other possible pairings for A in 
ithe event of permutable iabels. 

‘sy Wenewk Step (5) for any remaining points until 
all have been successfully or unsuccessfully 
paired. 

The Simon/Checkroun/Roche algorithm results in one 

of three possible outcomes: 

(1) impossible - one or more points have not been 
paired, 

(2) possible - all points are uniquely paired, 

(3) undetermined - one or more points have aultiple 
“possible pairings. 

While the ideas underlying the matching are 
intuitively sound, the process itself has several 
weaknesses. First, only patterns containing equal numbers 
of points may be matched. Second, declaring a match 
“impossible” may be incorrect as one pattern could be a 
subset of the other. Third, no disambiguating process such 
as or Similar to relaxation (see w#osenfeld (R1]) is 


incorporated ia an attempt to reduce the nuaber of 


"undetermined" cases. 


1.3. Matching by templates 


Zahn [21] describes an algorithm for detecting 
whether or not cne planar point set is a fragment of another 
which has possibly undergone scme arbitrary similarity 


transform and has had noise added. The algoritha is built 


on the sivilarity invariant aspect of the local structure of 
the minisal spanning tree ( MST ). 
Zahn's algorithm is as fcllows: 
(1) Compute the MST for fragment (f) and base (b). 
(2) For each ncde of f and b compute: 
(a) degree - numter of connections to other 
nodes via MST 
(b) if degree >= 2, coapute sinimua 
| suktended angle ( MSA ) by edge 
connections to other nodes, and 

| (c) length ratio ( LR) of the sinizua 

4 angle. 

(3) Atrange nodes into separate lists hased on 
degree and scrt each list on increasing value of 
MSA. 

(4) Scan through the fragment node lists until a 
“simple match" on scme node yields a good 
template match or fragrwent is exhausted with no 


match. That is, for each fragment node f of 


degree D, search base node lists, in order D, 


D+1, C-1, for matching MSA's and LB's’ sodulo 
scme. errcr tolerance. 

(5) If a simple match check is successful for some 
pair cf nodes ({ f, t ), test for a “local-fit"; 
check geometric consistencies of MSA's ( angle 
registraticn ) and L§&*s ( scale factors ) of 

+ respective neightorhcods of f and b. 

(6) If just one MSA/LR match is’ successful, then 
try a “global-fit". Sclving an appropriate 
linear system of equations provides coefficients 
for a similarity transform from base to fragment 
space. The transfcrm is then applied to all 
base points and each fragment point is mapped to 
its nearest base feint neighbor in the new 
space. If a sufficiently large proportion of 
frequent points match, modulo some tolerance, 
then the match is considered successful and 
searching terminates. 

Zahn's algorithm works fairly welljit is rather 
tolerant of scale, rotation, and translation between point 
patterns. However, it does not handle noise of the 
Brownian-mwotion type very well. Also, certain combinations 
of insignficant random additicns/deletions could 
sufficiently alter the MST sc as to be unrecognizable to his 


algoritha. 


1.4. Similarity relations and autocorrelation 


Seidl [S1] discusses a similarity relation based on 
mutual nearest neighbor conditions and a new (&-1)th order 
autocorrelation method for determining whether an & point 
pattern is Similar, in some way, to an N point pattern. 

Since Seidl presents no explicit algorithms per se, 
suffice it to say that he determines whether some subset of 
one point pattern matches some subset of another pattern if 
there exists a transformation under which corresponding 
points are mutual nearest neighbors. Seidl also considers 
autocorrelation similarity, that is, determining whether or 
not the chord sets - N * (N - 1) / 2. point pairs - are 


mutual nearest neighbors similar according to his exteaded 


definition of similarity. 
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CHAPTER 2 


BASIC ALGOKITHM 


The major concern of this chapter is iescribing the 
basic, brute torce matching algoritnm and all necessary 
terms, notations, etc. Although simplistic, in cases of 
reasonably similar Sketches, fairly good results are 
obtainable. AS will be seen, however, not much noise or 
distortion is tolerated. Later Chapters describe the 
effects of noise in detail and present some methods to 
reduce those erfects. 

Section 2.1 defines the terminology and notations 
which will be usei throughout this thesis. An informal, 


step-by-step description of the basic algorithm ( PTM/BA ) 


is presented in Section 2.2. Section 2.3 carries out a 


brief, but complete matching on a simple example. 


etree 
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2.1. Terminology and notations 


Contained in this section are definitions of the 
terms, notations, data structures, etc. used throughout the 
remainder of the thesis. 

A sketch is defined as a_ set, { P(i) { i = 1,N }, 
where N is the number of foints ina sketch. 

Each point , P(i), is represented by a three-tuple, 
{ x(i), yi), jt(i) ), where x(i) and y(i) are the X and Y 
Cartesian coordinates and jt(i) indicates the junction type 
( see Appendix A ) similar to those defined by Waltz { #1). 
NOTE: all junction types are ignored in PIM/BA. 

The "Interpoint Line Segment (_ILS_)" structure is 
an acray of the following components: 

(1) address ( into point set structure ) of first 

point of pair ( PT1), 

(2) address of second point ( PT2), 

(3) length of the imaginary line segment connecting 

the two points ( LENGTH ), 

(4) slope of that segment expressed in radians - 

range [ +PI, -PI ]} ( THETA ), and 

(5) flag indicating whether or not the point pair 

has a possible match or not ( FLAG ). 

The "Cartesian Coordinate Difference (_CCD_)" matrix 
is essentially a two-dimentional histograga. On algoritha 


tergpination it contains, in the case of matching point 


er nee 
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pairs, the positive/negative X and Y coordinate differences 
{ translation factors ) which, it added to the first point 
of the first pair, would translate that point so as to 
coincide with the first point ot the second pair. A 
singular cluster of significant concentrations indicates a 
successful sketch matching. 

The "kotational Histogram (_RH_)" vector contains 
the freguencies of occurrence of relative rotations ( THETA 
- THETA' ) between the line segments determined by possibly 
Batching point pairs. 


"Mismatch tolerance (_TOLK )" limits how imprecisely 


——— SS SS eee 1 See 


point pairs may fail to match before being rejected. 
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2.2. Qverview of basic algoritha 


Described below is the basic, brute force version of 
the matching algorithm ( PIM/BA ). Chapter 4& describes 
various enhancements to this version which reduce the 
Cunning tige, improve nadgise tolerance, and attempt to 
determine relative scale and rotation between sketches. 

PTM/BA consists of the following steps: 

(1) Input values for the SKETCH! set, 

{ Pl(i) t i = 1,N 3} , and the SKETCH2 set, 


€ P2(3) 1 3 


(2) Input the pismatch tolerance, TOLR. 


1,4 }. 


(3) Compute the lengths ( LENGTH ) and slopes 
( THETA ) of the ipaginary line segments defined 
by the N (N-1) / 2 point pairs of SKETCH! and 
the M (M-1) / 2 point pairs of SKETCH2. Store 
the results in the ILS structure. 

(4) Attempt to match each point pair of SKETCH! to 
each peint pair of SKETCH2 as follows. 
Referring to Pigure 1, consider the point pair 
( PI(1), P1(2) j and the pair { P2(1"), 
P2(2') Jj. Let the imaginary line segment A be 


considered one side of an imaginary triangle and 


Pigure 1. Imaginary line segment defined by a point pair. 


Figure 2. Imaginary triangle defined by two point 


EEE ROT Rr . - 


segment A* a second side as in Figure 2. 
Compute the length of the third side, A", by the 
fermula: 


| 
] 
| 
; 
| 
SQRT( A®*2 + At#E2 + 
| 
2#A*A'*CCS( A.SLOPE - A*.SLOPE )). | 


If A“.LENGTH < MIN( A.LENGTH, A‘*.LENGTH ) *TOLR, 


then A is a possible satch for A* so: 


(a) Ccmpute the translation factors between 


P1(1) and P2(1"), that is, P1.«(1) - 


P2.x(1") and El.y(2) - P2.y(1"), and 


(b) Use these factors to index into CCD and 


increment that element by one(1). 


(5) Display the CCD matrix. 


If SKETCH1 is reasonably similar to SKETCH2, then a 


cluster of significant concentrations should appear at those 


translaticn factors which wculd map similar parts onto one 


another; otherwise, no such cluster will exist. The above 


algorithm is illustrated in the next section by considering 


a simple example. 
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2.3. A Simple example 


The Cartesian coordinates of the two sketches to be 


| weutinteienae SS aay i IN pat ages 
| SKETCH1 {i SKETCH2 { 
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Table 1. SKETCH! and SKETCH2 point-data. 


used in the simple example are listed in Table 1. Figure 
3 displays the topographical structure of the sketches. The 
reader will note that SKETCH! is atranslated version of 
SKETCH2 with the uppermost and uppermost-lettmost points 
deleted. 
Beginning with step (3) of PTM/BA, the following 
iterations would occur: 
(1) First, [{ P1(1), P1(2) ] ot SKETCH? is matched 
against [ P2(1"), P2(2') Je [ P2(1"%), P2(3") ). 
( £2(4*), P2(5") }] of SKETCH2 as defined 


° . ° ¢ L 


in step (4) of PTM/BA( Section 2.2). The only 


Pigure 3. Graphic display of SKETCH! and SKETCH2. 
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pair satisfying the match criteria is 
{ P2(3"'), F2(4") j as illustrated in Pigure 4, 
ccD( 5, 5 ) is incremented by one(1). 
Second, [ F1(1), P1(3) ]} of SKETCH1 is matched 
against [ F2(1*}), P2¢2%) i, [ BP2(1*), P23") Lb 
oe «© « [ P2(4"), F2(5") ] of SKETCH2. Pairs 
[ P2(2"), B2(4") J and [ P2(3"), P2(5") } both 
satisfy the match criteria as shown in Figure 5. 
ccpy 16, 5.) and CCC( 5, 5) are both 
incremented. 


Lastly, [ §E1(2), F1(3) ]) of SKETCH1 is matched 


against [ PZ(1"), P2(2*) J]. Figure 6 shows that 


pairs [ P2(2"), P2(3") } and { P2(4*"), P2(5*) Jj 
satisfy the match criteria. ccD( 5, 0) and 

ccD( 5, 5 ) are incremented. 
Display of the CCD matrix, Figure 7, reveals a 
"high" concentration at cCCD( 5, 5) which indicates that 
SKETCH1 is similar to SKEICH2 and also defines the relative 
translaticn factors between the sketches. The 
"concentrations" at CCDE€ 1, 5S ) and cCCD( 5, 0) are 


extraneous. 
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Figuce 4. Possible matches for point pair 1-2. 


Figure 4. Possible matches for point pair 1-2. 
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Figure 5. Possible matches tor point pair 1-3. 
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Figure 6. Possible matches for point pairc 2-3. 
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2.4. Implementation details 


The author decided to implement the matching 
algorithms ( PIM/BA-EA ) in Bell Labs' "c" programming 
language for several reasons. First, since C is 
tailored/tuned to the architecture of a PDP 11 
minicomputer - CVL has a PDP 11/45 - and was developed as a 
high-level systems prograssing language, fairly efficient 
code is produced by the ccapiler. Second, numerous, 
easy-to-use system routines make input/output (I/0 ) to the 
user's terminal, line frinter, files, etc. relatively 
painless. Last, as a result of a fairly strict language 
aeutax, the C compiler detects many errors which vould 
normally not be found until run-tise in other languages. 


PIM/EA-EA is very modular and structured in nature. 


It consists of a main routine and approximately thirty 
supporting subroutines abcut ten of which are systea 

‘ routines or have been written by other members of CVL. The 
object module occupies about Zz kilobytes of PDP 11/745 
storage; code/data split is apprcximately 70% /30%. 

Some of PTM/BA-EA'S more noteworthy features are the 
following. Sketch data is inputted from user specified UNIX 
files( NOTE: UNIX is Bell Labs* PDP 11 operating system ). 
This feature cbvicusly saves the user a great deal of 
typing. To give the user an intuitive feeling of how 


similar two sketches are, a representation of each is 


2 
at be » 
Pe oa ye 
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“drawn” on a Grinnell color display system. After all point 
pairs have been matched, all point pairs which have 
potential matches in the other sketch are connected by lines 
on the Grinnell display. 

If the user desires, a debug display is produced as 
follows. When the I'th foint pair( the "stationary" pair ) 
of one sketch is being matched against all point pairs( the 
"moving" pairs ) of the other sketch a line is drawn to 
connect the stationary pair( SP ). As SP is matchea against 
one particular moving pairc( MP ), a line is dceawn to connect 
the MP. If SP potentially matched MP, the line connecting 
MP is leftj;otherwise, the line is "erased". If SP had at 
least one potential match to one of the MP's, its connecting 
line remains when new SP is selectea;otherwise, the line is 
erased. 

At some point some process would have to decide 
which point pairs of one sketch “actually" matched which 
point pairs of the other. To simply this process, PTM/EA 
creates a ( UNIX ) output file which coatains a list of all 
potentially matching point pairs. The List is in printable 
form for purposes of display. 

Hardcopy output from PTM/EA consists of a 
binary-valued display of the sketches( optional ), CCD 
matrix, and rotational histogran. After this hardcopy 
output has been printed, the user way tetminate the prograa 


at which time he can print the list of potentially matching 
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point pairs. The other is choice is continue the prograr 
and match another pair of sketches. 
Recalling that PTS/EA is Order(N**4), matching two 


twenty-point sketches ( 160,000 point pairs ) requires about 


twenty seconds of real-time with an average load on the PDP 


11/45. If the Grinnell debug display is done, about five 


RBinutes - a fifteen fold increase - of real-tigae is 


required. 


eRe Rey SO 


CHAPTER 3 


EXPERIMENTAL RESULTS 


The performance of algorithm PTM/BA is presented and 
analyzed in this chapter. To illustrate the power of the 
algorithm, expected test case results are compared with 
those actuaily obtained. 

Section 3.1 detines how feature points are selected, 
represented, and stored by decribing those functions in 
terms of two test images while Section 3.2 considers the 
determination of a suitable value for the mismatch 
tolerance. The main result of the thesis, Section 3.3, 
presents and analyzes the test case result for the two 
images. Noise effects are also examined. The last Section, 
3.4, briefly describes the results of matching random point 


patterns. 
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3.1. Selecting feature points 


Since no map/photograph database was available, it 
was approximated in the following manner. Two imayes, one, 
a picture of an army tank, the other a Washington, D.C. 
area coad map, were used to generate the test sketches. 

Two people were given a photocopy of both isages. 
Bach was told to select ( by circling ) the twenty or fever 
feature points which best characterized the images. The 
selected points were then given X and Y coordinates by 
assigning each to a unique block of an overlaid transparent, 
coordinate grid. In the case of the map, the points vere 
also assigned junction type labels similar to those defined 
by Waltz [W1]. Each pcint set was then entered into its own 
( PDP/11 UNIX ) file. 

Intuitively, this should give a fairly close 
approximation to the output of some segmentation alyoritha 
which had processed the map and aerial photograph. Figure 


6 and Figure 9 show the imayes and selected feature points. | 


Appendix B contains pictures and data-point files 
for all sketches used to test and analyze algorithas PTN/BA 


and PTM/EA. 
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(a) TAMKI 


(b) TAWK2 


Pigure 8. TANK! and TANK2 with designated feature poiats. 
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Figure 9a. MAP1 with designated feature points. 
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Figure 9b. SAP2 with designated feature poiats. 


s 


Taple 2a shows that the two sketches of tne tank 
have tive points in common and six near-misses - a Situation 
in which one oc both coordinates of one point differ by oniy 
+-1 from the correspondiny coordinates of a point in the 
other sketch. Table 9b shows the map sketches to have five 
common points and five near-isses. Matching either the 
tank sketches or the map sketches will prouuce a 
concentration of ten - 5 * (5-1) / 2 - at CCD( 0, 9 ); 


see Figure 10a and Figure lla. in these two cases, a 


mismatch tolerance of 0.00 (0% of grid size ) was used. if 


the tolerance 1S increased to 0.05, some of the near-misses 
contribute to the cluster of concentrations centered at 
CCD{ 0, 0). Greater tolerances, however, allow for very 
sloppy matches to be accepted; see Fiyure 10c-d and Figure 
1ic-d. Too large a response is undesirable from the 
Standpoint of haviny to disambiguate too many multiple 
possible matchings via, for example, relaxation. 

Evaluation of Fiyure 10 and Figure 11 ,in light of 
the number of common feints and near-misses, strongiy 
Suggests that 0.05 is a suitaple value for the mismatch 


tolerance and was, in tact, tre value used. 
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Tavle 2a. TANKI/TANK2 point-data. 
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Pigure 10c. TANK! vs. TANK2 with mismatch tolerance 0.10. 


i 
DP eee © eng | 


ceoocoocooecoccoococooocoocoocococoyoococooocoocoocoococoocoo0ccoooococo 
CODDDOCDODSDOCOODCDCDOD OCOD ODO OO OOODOCOOCOOOCOoODO DOGO ON0N000 
eoCooCDDDOCODODCOCOOOCOCOOOOC OOO QOD ODOC OODDDODOOOBOCBOCD00 
foMomowoNoRoReononoNohoReonononononoRonon onohononononononononononcoeononconoolonoReoRolonononohononononeone) 
cooooooco9eCcoocCCGCcCCCCOCOCCOCCCOCCCCCCCCCCCCCCCCOCCCCCCOCOCOCSNO 
ccoooooocooccocsocoCCcCCCoCCCoCCcCOCOCCCCCcoCocoocoocooCcCcoocoCc0OCoCOo°o 
cocooococeccocCoscCCcCcCCcsCoccoocoCCoOoOHnCcoceocococcocococoaoooco 
eo0o0oo00ccc0CCCCCCCCCCCCCCCCCCCCCCSCCCCCCCCCCCCCCCCCOCOCOO 
ecoooooceocooccoocoocooocooonmoocoocoocoo0cocoococococoCCcCoCcacoocos 
sooo0ooo0oo0oocooc0oqo0c0ccocoocooococooocoocococooococoocococo0CcCcCooCcCcooocoe 
ceooo0ococooccoCcoococCcCCCcOCOOoOOnxNooo0c0ooCoocacoccooccCoocococoooocoocsno 
ceocooocoo0ooococoococcocococococoococooococcoocooxnxoococococooococoecoo 
coocooocoooocooeccoocoococCcono00Cceonx~gogoc0cococecoocooccococoocooooooccone 
ceoooo0oocoooco0oono0c0coooccococSecoCcocoCc0cCcCONmMmonoooooCcoCCocccond 
cooooooo0oo0ocoo0ooco0oonmcococemMmMooccoococoocooocoococoocoooocoocoo 
coooo0oo0o0oooc0Cc0oCcoxnwx4nOGOnCoo0c0coccocCcCcomrnocoooocococoocooocoocooosno 


oooo0ooccoo0coxnGCoocCc0ccCoCcCCoCoOnccCcCcoonaoacoocoococoCcCcCCCoCCcoCocNo 


ceooococoocooococooccoonoococooonownaocooosaooc00cCcCco0c0Cocococ“ono 
ocoocoocoooco0ocoocooocoooccocooocooo0oonwoooccocooomocooocoocoocoooca 


ooo0o00ccocCcOoONnOooCCCoCOSCTOOCOCCOONNOOn~ODODCOOCCOKCOOCCCOOCOoOCOKCOoCoCOo 


CCOCOCOKCOCCOCOCoCOOO OKO COC On DOOOOOHMDOCCOOCOOCOOCRDOGOOO 
COSGOCODDODOODSCOHRODOONOOO DODO OOOOONGOOOCOMODDDODODOD0S 
ceoo000c00cc90c00cCC0C0C Ccoccocc9c0oOnqgoooqoooCConO00CCCCCOCOCCCCCCoCCCSo 
ceoooo0ooc0ocococo90omnocceoocoxnoGCnwooonOoOnoocoaccooonmooogo000coNd 
ocoocoocoocoocoococococo0ccomrtococococccccaomoonccooccoCCCC0CCScCS 
coooo0oo00oc0o0oc00c0g0ccooccccocCoCCCCCOCCOHOCCCONMoooCCOCCCCCCCSCCCS 
coooo0oc0o0co0coCcC0c00cCcoONoonxzcCOoOCoOCCOCCCOCCCCCOCCCOCOCCCCCCCCOCCOCSO 
coooo0oo0o0o0oo0oc0cogococococccconmoccccococoocoocoHnocooocoo0c0ococean 
ococooocoocoo0ocococeococ0ecccoococcoccc9oOoOHO0ONnCOOOCOCCGCCOCCOCOCOCOOSG 
ooooooooo0ooo000o0on0c000ceoocococoococoocococooscococoooococoocooc c0oc“ecea 
coooooco0oococoococoococooco0o0cso -oOoo0oM~ooco0cocoo0ccocoococococCocoCcCooca 
oooo0cocoocoocooCcCcoCoCcVeCocCcCCCCCCVCCoOKwOOCCCCoCcocCcoccoocooooocooocne 
coooo000cCco0ocococoococococooocoococcoocoococco9cCCCCCcCoccooccoCCCoCOoCCcSa 
ococoooo0oocooocooocococoococonmoonw~oooococoococccocCccocoocoece 
ecocoo0o0cc90Co900000CCCoCCOCCCCCCCCCCCCCCCCCCCCCCCCO CCC oOoOCSo 
ocooo0oo0oocooocoococococoscoooccocoooccoococeccocooocoo0occo0ccoococ“osc 
ooooooocooocooocococococoococeccooCCoCCCCcCoCcCoOCCCcCoOCOCCCCCCoOOoNo 
ecoo0o00c00cocococcoococoococcCCCCCCCCCCCCCCoOCcoCocoocoCoCCCoOCoCooSo 
secooo0oo0o0oc0cccCccoccccoccCCocOoOCCOCOCcCOoOuCOCOCOCOCOOCCOCCOCCCCOCOCCCOCOOCOSCoCOoS 
COOKTDDDOODDODODOOC OOO ODO OOOO COOOOOOOOOOCOOOCOOOCO OOOO 
scooooocc0oco0ccococoeccoccococccooc0oCCCOCOmoococoococooccooococoCcCCCCCCOCoOSs 


TANK2 with mismatch tolerance 0.15. 


TANK1 vse 


Figure 10d. 


loRomoBonoRoRoneBoBRoneRoRoRoRoRopoRozoneononononone) <oRekoRololololeololonohonohononohononohononohonoh-) 
escoococoococococoocc0ccccocooccoCCoCCCCCCCCCCOCCOCOCCeCCOCOCCCCCOSCCCSO 
coocooccoooc0ecoccoscocococCcCcCCoCCcoC9CCCCNCCCCCCCCCOCCOCCOCCOSCSCSCSCO 
lo RomoBRoBoBoBoBonoeonoRoRonoRoRogonononeononononowonon cou oRokeoRonoololcololeohofonohonononononohononone) 
lo RomoRoBomoRoReowowonoRoRokoR. BeokokosoncnonoeonononononosoRokonononelehonoholofonolohononohononohonone) 
Ssooo0coccCo9CCCoCCCCCCCCOCCOCOCCCOCCCCCOoOCOoOOCOCCCOCACCCOCCOOSO 
cooeoeocoocococo9o0CccoccoCcoCoCoCCCCCCCCCCCCOCOoOCOCOCCOOoOOCACCOOCDOCOOoO 
boRomoRoRoRoReoBonoRoBoRokononoscoeoRowokoRokouonononokviohoReononeoioneononolelelolololohonhehohohehononone) 
coooooo0oc0cc0c0cccococcocooccoCcCCCoCCCCCCCCCCocoOCcCCCCOCCCO0O 
omomoRomomoRoneomconononononohosoeonowononosononononon BokcokononononolonoLeoleleolonololoheolonohohononone! 
ogoooocoooCCoOCOOCOCoCOCOCOOOCOO-CCoCOCCOCSCSGCOCOSCCOOCOCOCOOOCSO 
csooooecococoococococ0cocooxococcoosccoocooCcCoCCoCcCoOcCCCCCOCOCCSD 
coooococoooocccocococococoCooOHC0CCOCOCOCCCCoOCOCOCCCCCCOCCCCOOSSO 
oRoRoRoMoRoRoRoRoRoNoNehe RoR Ro NohoNeReNoRoNeRoM Rolo RokoRolololoReohoNoloheh-helcfof-f-f-f-h-ohe h-hh) 
cssocooo0oo0o00occc0ccc0c0cccCcCcCCCCCCCcCCCCCCCOcCCCCOCCACCCCOCOOOCO 
oRomom- Bo moRononoBonononon Relohononononod Bonokonononononononononolonelel Leoleololohelolololohehonone) 
cooooccocoo0c0c09o0cccocCCCCONMNOCVCVCNOSCCOHOCCKCCCKDAOCGAOCGCCACCACCCSCOSO 


esoooecooco000c00cCccCocConCOCCCCCSCCOSSoOSCSCOCOCOGCOHn~SCCOCoOGOOOOCOCOO 


U 
cooooceooc0oc0occ0ococco9cccccocoocCccoCcCocCCcSocOoOoOcCCOOOCOCOCOCCCCCOCCO SOO 


SCHODDDDCDNDNDCCCCOHHOCOCOOCOHODOOCGCOCOnAACDCODODOOOKBDOCOCOOOOD 
SoOSCSCOOODCOCONOCOCOCOCCOHOCODOKHOCOCOSCCODOODOOOOROCOOOOCCND 
ScagcceooC OOOO COC ONDOO OCS OHCOOCO DDC OC OOO CORO ODOOODSD 
SSCOCCKCOODCOCOCNDCOCOMDODCOCOCOCCOOHxHxCOKDOAOOOOODOOOOCOO000 
SCOSCNDSCKCOOOCOOOCODOOOGCOCOOSCDOSCODUNOOCOSONCOOOCOSCOSOCOSCOSNDS 
SCKOSCSCSOCOOSCOCOCONDOOOCOH~CHOCDACGOGOCODDOOCOOCCOOCOCOOOOSO 
eccOoOncoCOCeCOScCOCONON oRokomeohonoMoReRo Role M Boloheoholeololoneholohoneole) 
loReoRoRoRezonoRoloRoheRoloveE Rololone) SseoeoseocoocoeCCCoOCOCOCCCCSCOS 
SCCOOKCKCKOKCCOOSCaGoOCKOOOOoOoOCODO OOOOH OCOOOCOCOCOGOCCCOGOSS 
CSCODNCCOCOCSCOODOSCoOHOOSCOCCOCOOOGOCOCOCODOCOOOCOCCOOCSSO 
SOO0SKCOODOCOOCOCSCOSOCOOCCCOONSCODOSCCOOCOSCOSCCOOOOSCSS 
SSS0CS0CCOCOOOHCOGDOOA0ONDDODSCCOCOSoCGOSCOOGOGOCCCOCOSSCS 
cseosocoocccocoeocscxH~ocoooccooOoOccCoCOOSOCOCOCSCDOOOSSOSCSCOS 
cacocseo00ococH00 coceococ0oscocoscooocooocoscDOSN 
ecoo00eccocaanD eoceooeoecoscooonoSCoCoOCOOSOOOCECcO 
ecocoo0cocoocce Ceo0ceDoeCoDAOOCOOCOOCOOCOSCCOCaACCSSO 
oceooooscccse coceuooococcoocccocaeoSsCOOOSCOCSO 
CGDOSSISCOCOCOOS: COOOKnS OOOO COO SC OC OOCOODODOODOHOSCO 
-OCGSOSOCOSS oecooceoocooepoSsSODOCCCOCOCSCOD 
aSeocsooncsoce oscoeosococooOsNSONAGCCOOSCSODSCSOO 
GOSGsoCoNOCC OOK. CeGoncseeccsooscooséoo0o00socoonoosn 
scooscoscsoon: cececcosa9090CcCOScCDOCeCOOSS 


090 
lemme) 


900000 
a0 


5,oO 6 ¢0 600 000 0:0 0 
oa0 
a 
fo) 
fe) 


00000000000 


900000 
ao 
aooodoe 
0o0aqgo0o0on 
0°90 


cs 
GS 
o 
LY 
o 
°o 
c 
o 
Cc 
oS 
o 
Go 
Ss 
cS 


9000000000 


0 
Oo © 
G 


cv) 
060 


a ¢ 
0 


Figure lla. MAP1 vs MAP2 with tolerance 0.00. 
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Figure 11b. MAP1 vs MAP2 with tolerance 0.05. 
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Figure 11c. MAP1 vs MAP2 with tolerance 0.10. 
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Figure 11d. MAP1 vs MAP2 with tolerance 0.15. 
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To determine the robustness of the matching 
alyoritha, varyiny amounts of ditferent types of noise were 
added to copies of the MAP2 sketcn. The following types of 
noise were considered: 

(1) brownian-type motion - Section 3.3.1, 

(2) rotation - Section 3.3.2, 

(3) rescaling - Section 3.3.3, and 

(4) random addition(s)/deletion(s) - Section 3.3.4. 

Eaca section presents the expected results first and 


then evaluates the actual results in terms of those 


expectations. 


3.3.1. Erownian-type moticn noise efiects 


Four mutated copies of MAP2 were producea by msoving 
each point one tnit ( 3.1254 of the sketch size ), two units 
( 6.25% ), four units ( 12.5% ), and eight units ( 25% ), 
respectively, in one ot eight randomly selected directions - 
labelled 0 through 7 - as illiustcatea in Figure 12. Note: 
“a indicates the location from which a point is moved. 

Table 3 defines the relative point pair movement 
possibilities. Each possibility is one of six events listed 


below. One point ot an arbitrary pair moves in_ the 
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Figure 12. Directions ot possible point movement. 
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Table 3. Relative point movements. 


direction indicated by the column headings. The other point 


woves aS indicatea by the row headings. 


The six possible events define 


the 
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Length-Factor-Change( LFC )-in-distance petween two 
arbitrary points as a result of the unit movement of each. 
Those events and freguencies of occurrence are: 

(1) E1- 0 unit LFC, 38 times ( out of 64 ), 

(2) E2—- 1 unit LFC, 16 times, 

(3) £3 - 2 units LFC, 12 times, 

(4) E4 — SQRI(2) units LFC, 8 tiues, 

(5) ES - SQRT(5) units LFC, 16 times, and 

(6) Eo — 2¥*SQKT(2) units LFC, 4 times. 

The percent of jrid size change of the six events 


and various Brownian pcvement distances are listea in Taple 
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Table 4. Percents of grid size for various LFC's/distances. 


4; if each point moves distance D, then the distance change 
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is D * LFC. 

‘If a sketch is matched against itself, okviously a 
concentration of vaiue N (N-1) / 2 ( Maxinmun Expected 
Concentration ) Will occur at cCCD( 0, O );however, other 
extraneous concentrations may also occur. When a Sketch is 
Matched against a copy which nas undergone Brownian 
movement, one may expect the following concentrations: 

(1) (© - MEC#1 at (0,0) 

(2) - MEC*1/8 at (0,0) + NXM*1/4 at (#-1,4#-1) ¢ 

NXM*1/8 at (#-SQRT (2) ,+-SQRT (2)) 

(3) 2 - MEC*1 at (0,0) 

(4) 4 - MEC*1 at (0,0) 

(5) 8 - MEC*1 at (0,0). 
where: 

(1) MEC = N (N-1) / 2, 

(2) (+-#,+-#) 1S interpreted as: 

(a) CCD (+#,0), 
(bp) CCD(-#,0), 
(c) CCD(0,+#), and 
(d) CCD(0,-#). 

NOTE: lack of a “t-“" means that only one element, 

CCD (#,#), is desiynated. 

The atove expectations are based on the assumption 
that the pseudo-random number generator will produce a 
uniform distribution, that is, all eight directions of 


movement are eygually likely; however, the expectations are 
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Subject to the usual frobablistic interpretations. Hence, 
one should expect that only unit distance Brownian movement 
will yield any significant cluster of concentratious. This 


is illustrated oy Figure 1t3a-d. 


3.3.2. Rotational noise effects. 


Six rotated versions of MAP2 were produced 
cotations of: 

(1) 5 degrees, 

(2) 10 degrees, 

(3) 15 degrees, 

(4) 20 aegjrees, 

(5) 25 degrees, 

(6) 30 degrees. 
The point closest to an "eyeball estimate" of the centroid 
of MAP2 was chosen tc be the origin of rotation in ail 
cases. 


Effects of rotational noise can be predicted in the 


following manner. Censider two points which are unit 


distance apart. If the imaginary line segment connecting 
the points is duplicated and rotated aS above, one can 
compute the iength of the third side of the imaginary 
triangles thus produced. Recall that mismatch tolerance, 


defined in terms of grid size, limits how long the imaginary 
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13a. MAP2 vs. MAP2.B1 with toierance 0.05. 
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13b. MAP2 vs. MAP2.B2 with tolerance 0.05. 
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cococoeoooooocoo0oocoocococoxcqcococoocooceocownwo000Ccq000c00090 
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eoooo0o0ococococ0cccoxnmcoocococoococcocccocoaccccc900c0cccoo0oco0o 


13d. MAP2 vs. MAP2.B8 with tolerance 0.05. 
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joooocoosocoooccocoooooc0ococoocooaonocococeccooqc0coocococoooccoocoso 
ooooacococoogcccoococHoococecoescoocsccsccoaccoeocoCcecsooceg 
coccesosoocsocecococecseoscocecooocecooaccoccocoqgosaooccsa 
eoscoococoeocscocooococoo0oocoococoooccecococoocoscoocoocococococ0ooco 
eceoocsaoscscooonwOsSSD00COFOCOOCOSCOSCOCOCONDOCOSCOSCCOOCOCSCOONDN (7) 
ocooscocecoogcoootoocococcococcaocsooooccosoocses | 
eooc0oosog0coesooscooooocoscococcoocoo7eocn0 osc 00c0o0000 an 
eocoscs0oco0cocosec0cscoocoocooocoossq0q0cococcococoqo0ccoocesc el 
ecocoscesascocecococecesooocococococecoececeaescocsa ™ 
SSaocoocscrro0oscocoooooocoosgoococoqnocodeq0 S000 0o-a0 
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third side may be before cauSing rejections as a possible 


Match. Percents ot grid size Chanye are then computable 


co --7y--——_—_------- ----- 


PR ee ere ee eee 
JDegrees |Length or 3rd side |Grid size percent | 


---—---- ~~ +--———- — 
1 5 | 0.06723 1 3.723 
‘--————- —- eo eae SS a ee A ce: —a 

| 0m { 0.17431 { 17.431 


j——-—-------------------—--—----__--- 
‘a pee 0. 26105 26. 105 


{---—---+--—-------—--—---~--+-— 
ae ee 0.43287 


j---—-------------—------ + 
| 25 | 0.51763 
— 


Table 5. Percents of grid size for various rotations. 


from these as tabulated in Tavle 5. Noting the percentages, 


One would expect to achieve significant concentrations only 


for a rotation of 5 deyrees. Fiyure 14a-f shows this to be 


true. 


3.3.3. hkescaliny noise eftects. 


Five rescaled covies of MAP2 were generated 
rescaling point coordinates py the following ftactors: 
(1) 1.01 ( NOTE: this produced no change 
coordinate values ), 


(2) 1602, 


cs2000000cC0CCCCOCOCCCCNCCCCCCOCOMoocccoco9c00c0ccc0oCCcCcCCCCcCCCCCSD 
e27e0000c00CcCCKCCCCCCOCCCCOCCCCCCCCCCCOCOCCCCOCOCCCSCCCCOSCOCOS 
evooo0co0occoccocoocooccoocococoococococcocCcCcoCcCcCCCcCOoOCCCOCOCOSCCOOoCOoCCSO 
escocooooocooocoocoococc0cooCooOnoOGCCCOCOCOCCCCCCCCCCSCCCCCCCSCSCSCCS 
ev,se0c000c0c0occccCcCCCCCCOCCCCCCCOCCCCCCCCCCCCCOCOCCCCCCOCCCSN 
aesc00000c0cc00c0CcCccccoocoococcooccooooccocoo9ocooCCCCCCCCCOoCCSoOoCOCoCS 
escoocooococococoCcCoCCCCCCCCCCCCoCOCCCCCCOCCCCCCCCCCCCCCCCCOCSO 
CSSCDD0DDDDODOCDOOCOOOODDOOOCOCOOOOOCOOC OOD OOODDODDDOODD0N0000 
evoeo0oocoocococcCCCcCCCCCCCCCCOoOHnOoCOCOQCCCCOCOCCCCCOCOoCCCOCOSCSCSCSO 
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eseco0o0000c000coocoocoCoCcCCCCOHOCCCOCOCCCCCCCCCCCCCOCOCCCOCCCCCCSCSO 
e2se000000000CCCCCOoOHno0000 COCCOOHOCCCCCOCCCCCOCCCCOCCOCOCOCOCSO 
evs0000000cCogooccCoCCCCCCCOCCCCOCOCCCOMOCCCCCCOCOOCCCOCOCCCCCOCSO 
ee200000cCcCCoCoCCCCCOCCCCCCCOCOCCOCOOCCCOCCOKCCCCCCCCCCCCCGCCCOSCOO 
ec9co0oo0oo0ccCoKoCCoCCCCCCCCCONGCAGCCOCOCOHAGCCCCCACCCCOCCCCCCCCCCO 
ese00o0o00000cCcCcCCCCCCCoOHCOHOOHOOHOOKOCCCCOCCCCCCCCCCCCOCOCSO 
esqoo0000cocc0CoCcCoCoCCCCCCOoOAOOnHAGD NCCC CCCCCCCCCCOCSCGOCSCOCCOCOCOCOCOCOCO 
ese0o00c00CcCcCcCCoOHoCCOCOCOnOCOCCCCCCOCCOOCOCCCCCCCCCCCCCCOCCCOCCSO 
PSCSCDCOOCOOCNCCOCCOCCOOOMNOOOHOCOCOCOOCOCOOCOOOHOCOCCOCCOCCCOCOCOCOOCCSD 
es70q00000000gCOcCCCCOCOCOCCCOCOCOMOn~CCNCOGOOCCCOCNACGCOCOCCCOCOCCCC0CNO 
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eos2eo00000cCCCCcCCcCCCCCOHOOCCOOCOCOCCCOCCCOSCCCCOCNCGOOCCCCCCCCCCRCSO 
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er,cocoocooccocoCcCocCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCOCOCCOCS 
escoccococcooocCoCoCcCCoCCCCCCCOCCCCCOCOCCCCOCCCOCOCCCCOCCOCCCOCOCCO 
2920000000 OOCOCOCOCOCOCOCOCOOCOOMoocccCcCCCoOSGCCOCCDNDCOCCCCCCSCSO 


MAP2.R5 with tolerance 0.05. 


14a. MAP2 vs. 
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MAP2.R10 with tolerance 0.05. 


MAP2 vs. 


Pigure 14b. 
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coooogoo0ooocoocococcoocococCcCcoMocococoooc0cocococ0coccoccccoccCoCscScS 
ee0000c0cocoCocCoCCCCCCCCCCCCcCNOOCCCOCOCCCOCOCOOCOCOCCCOCOCCCCOCOSCOO 
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Pigure 14c. MAP2 vs. MAP2.R15 with tolerance 0.05. 


Te 


: 
; 
3 


ceocoooocoooecoococoocccocCcCcCCCoOFoocooooco0ccocccCcCcCoCcoCCCOCCOoOCOo 
ceooo0oo0o0o0cocooooc9ecCCCCCCGCCCCCCCCCCCCGOCCCCCCCCCCCCCCOCOCO 
cecooocooccooccoocooccoccCCCCCCCCCCCCCCCOCCCCCCCCCCCCCCOCOSO 
coocoocooecooooco0ocCCCCCCCCCCCCCCOCCOCCOCCOCCCCCCCCNCCCOCCOCCSO 
seoo0c00cc0cCcC CCC CCCCCCCCCCCCCCCCCOCCCOCOCOCOCCOCCOCONCCOCCCOCOCOCSCOCO 
coooecoooccoococccoococCcoccocoCoCCCOCCCCCOCOCCCaCCCoCCOOCOCCCCCOCCOCOOCOOoOO 
ceosccooooocoocoococccococoCcCCCoCCoCOCCCCOCCOCCOCCCOCCOCCCOCCOCOSC COCO 
coocoo0gc00o9coCccCoCno0CCCCCCCCCCNOCCCOCCCCOCCOCCCCCCCNCCCCCCOCOCOCONO 
SCOKOKDODDODOOCODOOOOCOOOOOOOOCOOCOOOHnD OOO ODO OOOO OO OOOO D00ND 
ceooooeooceoocooccocoCcocCoCCCCCCCCCCCOCCCOOnwocCGCCOCONQCCCCOCOCCCOOCO 
ceogoc00cocococoococococoocococooococccCcooOw~ocoo0cCCCCCCCCCOCCCCCOCOCNO 
coooo0oooooococooccc9ocCOnC0CgCCcCCOCOOxoooCCOCOCCCCCOCCCCCOCCoCOCOoOOCS 
ceoooo0o0oc0eco0cococcoooccCcccoCCcCoCCCCCCCCOMmoecoCooCcoooCCOCCOoCCOoCSO 
soooo0o0o0o0o0oc0o0occoc0cocoqc0cccocoNncooCoCCcCocCooCCCoCcCoCCOCOOCCCOoScS 
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Pigure 14d. MAP2 vs. 4AP2.R20 with tolerance 0.05. 
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Figure t4e. MAP2 vs. MAP2.R25 with tolerance 0.05. 
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(3) 1.05, 

(4) 1.10, and 

(5) 1.20. 

Matching results for the above cases are also rather 
predictable. Since a mismatca tolerance of 0.05 (5% ) is 
allowed, one would expect the cases (1), (2), and (3) to 
produce significant concentrations. Figure 15a-c shows this 
to be true. One would also expect cases (4) and (5) to give 
insignificant responses; however, Figure 15d-e show this to 
be true only for case (5). The reason for this anomaly is 
roundoff error which can result in some points being up to 
SQRT(2) units ( 4.419% ) closer than they should be. Hence, 


the "unexpected" significant response. 


3.3.4. Random additionsydeletions noise effects. 


Four mutated versions of MAP2 were produced by 
randomly selecting one, two, four, and eight points for 
replacement by randomly generated points. Results will be 
good in all four cases because in each case there will be 
N (N~1) / 2 ( where N is the number of unreplaced points ) 
point pairs which remain exactly the same. 

The four cases should produce concentrations of 153, 
136, 105, and “55, respectively, at CCD( 0, 0 ). Any 


deviations would be in the form of higher valued 
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Figure 15a. MAP2 vs. MAP2.S01 with tolerance 0.05. 
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Pigure 15b. MAP2 vs. MAP2.S02 with tolerance 0.05. 
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Figure 15c. MAP2 vs. MAP2.S05 with tolerance 0.05. 
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Figure 15d. MAP2 vs. MAP2.S10 with tolerance 0.05. 
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concentrations due to contributioas from the candoaly 


generated points. Figure 16a-d illustrates the accuracy of 


these expectations. 
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Figure 16b. MAP2 vs. MAP2.AD2 with tolerance 0.05. 
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Figure 6c. MAP2 vs. MAP2.AD4 with tolerance 0.05. 
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Piguce 16d. MAP2 vs. MAP2.AD1 with tolerance 0.05. 


68 


3-4. Kandom point pattern matching 


Five random point patterns were generated providing 
ten pattern combinaticns. In essentially all cases only 
very widely scattered “concentrations” of values one(1) or 
two(2) were produced. In one instance two occurences of 
value three(3) and one cccurrence of value four(4) resulted. 

This fact tends to support the hypothesis that 
"fortuitous/chance" significant concentrations will not be 


Froducec. 
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CHAPTER 4 


ENHANCED MAICHING ALGORITHS 


This chapter is concerned with describing the 
various enhancements - some implemented, scme not - to the 
basic PITM/BA algorithm, These enhancements are designed to 
reduce the etfects of the Brownian, rotational, and 
cescaling noise described in Chapter 3 as well as produce 
more of the information that would be required to accomplish 
registration. 

Section 4.1 describes why the primary enhancements 
improve the matching cargavilities of PIM/sBA. Section 4.2 
shows the improved results for the above types of noise and 
suggests/shows specific ways to further reduce noise 


effects. 
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4.1. Basic algoriths enhancements 


The primary enhancement tc PTM/BA was the use of 
additional match criteria - junction type ( Waltz [W1) ) 
testing. Recalling that each point P(i) is represented by 
the three-tuple ( xi), yi). je(i) )- the following 
requirement was added: 

Let P1{i),P1(4) and P2(i*),P2(j") be two 

arbitrary points frcu SKETCH! and SKETCH2, 

respectively. Then: 

(1) If Pi. jt(i) = P2.4t(i") AND PI. §t(jp = 

P2.jt(j'), or 
(2) Tf P1.4t (i) = P2.jt(j*) AND P1,jt(j) = 
P2.4t(i') 
the pair is considered to be a pcssible match. 
Use of this additional contextual information will reduce 
the number of point fairs that need to be compared by the 


"BA" portion of the enhanced algoritha, PTM/EA, thus 


reducing the running time. 

Furthermore, one may increase the sismatch tolerance 
because if junction types match, then one is willing to 
tolerate a less precise length patch as the point pairs are 
more likely to represent the same features. As will be 
seen, increasing the sismatch tclerance to 0.15 results in 
approximately the same number of entries into the CCD matrix 


by ailowing a more sloppy, while at the same time sore 
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confident, match of pcint pairs. This aight allow a map 
which does not very accurately depict "ground truth" to be 
successfully matched to ah aerial photograph of the area. 


Other ( specific ) enhancements will be discussed 


under the types of noise they are designed to rceauce. 


TNT TTT 


72 


4.2.1. Reducing Brownian motion noise effects 


As can be seen in Figure 17a-d, the general 
enhancements descrited above yield a significant improvment 
in response. While PIM/BA suffered an 85% reduction in the 
concentration strengths clustered about CCD( 0, 0) for one 
unit movement, PTM/EA suffered only a 36% reduction. In the 
case of two units’ movement, PIM/BA produced no discernible 
clusters, PTM/EA yielded a scattered cluster centered about 
ccD( 9, 0); this suggests an additional enhancement ( not 
implemented ) as described belcw. Pour and eight units! 
movement produced no siginficant concentrations for either 
version. 

If cne expects Brownian mction of one/two units, a 
cluster detection template such as Figure 18 could be 


applied to detect the scattered clusters in the CCD matrix 


( see Rosenfeld and Kak (82) ). 


4.2.2. Reducing rotaticnal noise effects 


Results for rotational ncise are similar to those 


for Brownian noise - see Figure 1Sa-f. In the case of a 5 
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Figure 17a. MAP2 vs. MAP2.B1 with tolerance 0.15 
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Pigure 17b. MAP2 vs. MAP2.B2 with tolerance 0.15. 
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Figure 17c. MAP2 vs. MAP2.B4 with tolerance 0.15. 
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Pigure 17d. MAP2 vs. MAP2.B8 with tolerance 0.15. 
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Pigure 18. Cluster detection template. 


degree rotation PTM/BA suffered an 84% reduction in 
concentration strengths whereas ETM/EA suffered only a 17% 
reduction. For 10 degrees cf rotation no significant 


concentrations were prcduced by PIM/BA while PTM/EA yielded 


a scattered cluster similar to the one described above for 
Brownian sovegsent of two units. Hence, cluster detection 


teaplates weuld prove cuseful here also. 


ee 


As previously mentioned, if junction types satch, 
two point pairs are likely to represent the same features. 
Given this assumption, one can determine, fairly accurately, 
the relative rotaticn of one sketch with respect to the 


other by means of a “rotational histcgraa". 


The rotational histogras is computed as follows. If 


junction types match, then ccapute the difference in slopes 


of the ipaginary line segments determined by the point 


pairs. Divide the difference by the radian equivalent of 5 
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Figure 19a. 
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degrees. Use the quotient as a bin index and increment that 
bints freguency count by one(1). 

Using a bin width of 5 degrees was basea on the 
ability of PTM/BA to produce significant concentrations in 
the case of 5 degrees of rotation as well aS not wanting too 
Many contributions to each pin. Selection of the maximun 
valued kin would iundicate the relative rotation ( see Table 
6a-f for examples ). After such determination, one of the 
sketches could be apeccpriately rotated and then PIM/EA 
could ke rerub in hopes of achieving higher-valued, 


less-scattered concentrations in the CCD matrix. 


4.2.3. Reducing rescaliny noise effects 


As mentioned in Section 3.3.3, matching for 
rescaling cases tends tc wreak down after the scaling 
factors exceed the mismatch tolerance. This’ behavior did 
not change in ETM/EA as illustrated in Figure 20a-e. 

In a manner Similac to that described in Section 
4.3, one could compute a "rescaling histogram", selecting 
the maximum valued bin as indicating the relative rescaling 


factors, tescaie, and then rematch. 
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Table 6a. Rotational histogram for rotation of 5 degrees. 
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Table 6b. Rotational histogram for rotation of 10 degrees. 
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Table 6c. Rotational histogram for rotation of 15 degrees. 
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Table 6d. Rotational histogras for rotation of 20 degrees. 
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Figute 20a. MAP2 vs. MAP2.S01 with tolerance 0.15. 
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Figure 20b. MAP2 vs. MAP2.S02 with tolerance 0.15. 
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Figure 20c. MAP2 vs. MAP2.S05 with tolerance 0.15. 
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Figure 20d. MAP2 vs. MAP2.S10 with tolerance 0.15. 
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Figure 20e. MAP2 WS. MAP2.S15 with tolerance 0.15. 
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CONCLUSIONS 


This chapter summarizes the results reported in 


Chapters 2-4 and suggests areas for additional research. 
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5-1. Recommendations 


while the relative rotation between sketches can de 
determined, a method fcr determininy the point about which 
rotation has occurred needs to be developed. The author, 
for “usepoSes of this research, chose to assume rotation 
would have been apcut the eyeball estimate of the point 
closest to tie centroid. 

Both versions of the algorithm, PTM/BA and PTM/EA, 
are order (N**4). An apfreach which might be taken could be 
a kind of state space search. It the alyorithbm were to 
monitor the CCD matrix and it noticed a build-up of a 
wiguiticant ( cluster cf) concentration, then the “path" 
defined by that concentration could be followed; that is, 
only those }airs that would contribute to the same 
concentration would ke matched. It a “sufficient 
propoction of the wmaxinua humper of fossible matches 
occurred, then the alyorithm would terminate with a 
successful watching. If such a heuristic, state space 
search sere applied to two matchiny sketches, actual cunning 
time should be considerably less than order N#*#*4, 

Additionally, the sketch data-point files could be 
sorted on junction type values from most to least 
distinctive/comple x. Atter sorting, points wita the sane, 
most distinctive junction types would be matched. If the 


monitor noticed a significant concentration, the path 
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defined by that concentration could be followed as above. 
Otherwise, points with nearly the same junction types could 
be matched; then similar junction types if no matches 
discovered and so torth until ain the caSe ot non-matching 
sketches all points wotlu have been watched. 

One last retinement could function as _ follows. 
Detine some minimal conccntration thresnold value. If this 
threshold were not exceeded after some appropriate 
proportion of points hau been matcneu, then the algoritha 
woula terminate with the indication that the sketches did 


not match. 
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5.2. Summary 


The goal of this thesis was to build a foundation 
for CVL*s proposed map-guided segmentation system. Chapter 2 
defined a simple, brute force sketch matching algorithm 
based on the relative positions of the ( feature ) points. 
A simple example illustrated the methodology. 

Chapter 3 described how a suitable mismatch 
tolerance waS selected and dealt with the effects of varying 
amcunts of various types of noise ( Brownian motion, 
rotatior, rescaling, and randow point replacement ). 
Presented in Chapter 4 were various enhancemeats/suggestions 
to ceduce the cunning time and the effects cf noise. 
Addition of junction type label cneckiny ceduced the number 
of -airs that had to be matched "in full" and allowed a 
larger mismatcn tolerance. Methods of determining the 


relative rotation and scale between sketches wete presented. 
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